Generalized guard-zone algorithm (GGA) for learning: automatic selection of threshold

نویسندگان

  • Amita Pal
  • Sankar K. Pal
چکیده

Absrract-This work IS a continuation of our earlier work on the Generalized Guard-zones Algorithm (GGA) for self-supervised parameter learning. An atlempt is made here for the automatic determination of the guard-zone parameter i' N (i.e. the threshold used for discarding doubtful or mislabeled samples) at every instant of learning, for the general m-elass N-featu re pattern recognition problem. This is done by minimizing the mean squared error (MSE) of the estimate, under a simple probabilistic model which takes into consideration the presence of mislabeled training samples. Under the assumptions of normality, it is found that Ihe estimates for i," so obtained are distribution-free, that is, they do not depend on the parameters of the distribution. They are functions of N, the iteration number n and certain percentage points or the beta distribution with parameters Nand TIN. The effectiveness of the automatic selection of guard-zone dimension is further demonstrated on a bivariate three-class data set to show the improvement in performance of the GGA. GGA Learning Optimum dimension/threshold 1. INTRODUCTION automatically based on mean squared error (MSE). The explicit expressions for the MSE are obtained for A Generalized Guard-lOne Algorithm (GGA) was both the GGA and the non-GGA (i.e. the usual described by Pathak and Pal(l) for learning class unsupervised stochastic approximation learning parameters using a restricted updating program, algorithm not based on guard-zone) using the model together with investigation of its stochastic conver­ ofChittineni,16l involving mislabeled training samples. gence for optimum learning. Basically, the aim of the An approximation for the guard-zone parameter An GGA is to detect mislabeled training samples and was obtained for which the MSE for the GGA is outliers and to reject them from the parameter updat­ smaller than that for the non-GGA. In other words, ing procedure. The algorithm is a generalization of the value of i'N selected automatically by the system some existing ones(2.J) which were found to be useful makes the GGA discard the doubtful (mislabeled) for practical data but without mathematical formu­ samples from the training procedure, thus improving lation of their various features (e.g. con vergence, its performance visit -vis the non-GGA for self-super­ optimum dimension for guard zone/threshold, per­ vised learning. formance in presence of mislabeling, etc.) This feature is further exemplified with the help of Recently, it was reported(4) that the guard zone a two-feature three-class normally distributed data parameter (An) of GGA lies between certain bounds set. and …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Two-Threshold Guard Channel Scheme for Minimizing Blocking Probability in Communication Networks

In this paper, we consider the call admission problem in cellular network with two classes of voice users. In the first part of paper, we introduce a two-threshold guard channel policy and study its limiting behavior under the stationary traffic. Then we give an algorithm for finding the optimal number of guard channels. In the second part of this paper, we give an algorithm, which minimizes th...

متن کامل

Stock Price Prediction using Machine Learning and Swarm Intelligence

Background and Objectives: Stock price prediction has become one of the interesting and also challenging topics for researchers in the past few years. Due to the non-linear nature of the time-series data of the stock prices, mathematical modeling approaches usually fail to yield acceptable results. Therefore, machine learning methods can be a promising solution to this problem. Methods: In this...

متن کامل

Negative Selection Based Data Classification with Flexible Boundaries

One of the most important artificial immune algorithms is negative selection algorithm, which is an anomaly detection and pattern recognition technique; however, recent research has shown the successful application of this algorithm in data classification. Most of the negative selection methods consider deterministic boundaries to distinguish between self and non-self-spaces. In this paper, two...

متن کامل

Iterative Approach for Automatic Beam Angle Selection in Intensity Modulated Radiation Therapy Planning

Introduction: Beam-angle optimization (BAO) is a computationally intensive problem for a number of reasons. First, the search space of the solutions is huge, requiring enumeration of all possible beam orientation combinations. For example, when choosing 4 angles out of 36 candidate beam angles, C36 = 58905 possible combinations exist.  Second, any change in a   beam 4 config...

متن کامل

Dynamic guard zone for self-supervised learning

Ab.macl The dimension of tbe guard lOne along with its bounds for the Generalised Learning Algorithm (P"thak and Pal. 1986) is determined 1'01' optimum learning. The dimension is found to be dynamic dependlllg on the input sequence and tbe current estimates of classification parameters. Incorporation of tbis higher-order knowledge in a supervisory program improves the system performance. The pe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Pattern Recognition

دوره 23  شماره 

صفحات  -

تاریخ انتشار 1990